Limits of Adaptive Optics for high contrast imaging 
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ABSTRACT 

The effects of photon noise, aliasing, wavefront chromaticity and scintillation on the point 
spread function (PSF) contrast achievable with ground based adaptive optics (AO) are evaluated 
for different wavefront sensing schemes. I show that a wavefront sensor (WFS) based upon the 
Zernike phase contrast technique offers the best sensitivity to photon noise at all spatial frequen- 
cies, while the Shack-Hartmann WFS is significantly less sensitive. In AO systems performing 
wavefront sensing in the visible and scientific imaging in the near-IR, the PSF contrast limit 
is set by the scintillation chromaticity induced by Fresnel propagation through the atmosphere. 
On a 8m telescope, the PSF contrast is then limited to le-4 to le-5 in the central arcsecond. 
Wavefront sensing and scientific imaging should therefore be done at the same wavelength, in 
which case, on bright sources, PSF contrasts between le-6 and le-7 can be achieved within 1 
arcsecond on a 8m telescope in optical/near- IR. The impact of atmospheric turbulence param- 
eters (seeing, wind speed, turbulence profile) on the PSF contrast is quantified. I show that a 
focal plane wavefront sensing scheme offers unique advantages, and I discuss how to implement 
it. Coronagraphic options are also briefly discussed. 



Subject headings: instrumentation: adaptive optics — instrumentation: interferometers 
data analysis — techniques: interferometric — techniques: high angular resolution 



methods: 



1. Introduction 

High contrast imaging of the immediate en- 
vironment (within a few astronomical units) of 
nearby stars is critical to the understanding of for- 
mation and evolution of planetary systems. The 
ultimate goal of planetary systems studies is to 
find and characterize planets similar to ours, in the 
hope that we can find other "habitable" words sus- 
ceptible of harboring life. Several approaches are 
currently under development to achieve the high 
level of contrast required : 

• Nulling interferometry in the mid-IR with a 
30m baseline space interferometer. 

• Visible coronagraphy with a 4m to 8m space 
telescope. 

• Large ground based telescopes (8m to 100m) 
and high performance AO systems optimized 
for bright targets. 



While the first 2 options are targeting Earth-size 
planets around nearby stars, the ground based sys- 
tems have more modest initial goals: planets more 
massive than Jupiter or young Jupiter mass plan- 
ets. The plans to build larger (30m to 100m) tele- 
scopes and the fast progress in high-performance 
AO systems and coronagraphy however opens up 
the possibility of pursuing more ambitious goals. 
Direct imaging of Earth-size planets is even con- 
sidered for 100m diameter telescopes (Gilmozzi 
2004; Hawarden et al. 2003). 

The contrast detection limit within a PSF is set 
by photon noise and speckle noise in the image. If 
only photon noise is considered the theoretical de- 
tection limit for a large telescope (30m to 100m) 
allows relatively easy detection of Earth-size plan- 
ets around nearby stars (Angel 2003; Hawarden et 
al. 2003). This however seems to be a very opti- 
mistic assumption, as current AO systems are all 
limited by speckle noise (Racine et al. 1999) within 



1 



the central arcsecond. The detection limit is there- 
fore likely to be driven by how well the speck- 
les can be calibrated/removed. Fast atmospheric 
speckles average fairly rapidly, but slower-evolving 
speckles are more problematic. Experience ac- 
quired on ground-based telescope has shown that 
there is no such thing as a truly static speckles, as 
extremely small drifts in the wavefront are suffi- 
cient to appreciably change the speckle's intensity: 
for a speckle of intensity 10~ 5 the central star's in- 
tensity to be stable to within 1%, the correspond- 
ing spatial frequency would need to be stable to 
2.5 10 -6 wave (4 picometers at 1.6 /mi). 

In non-coronagraphic imaging with current AO 
systems on large telescopes, the PSF wings are 
relatively smooth in long exposures: about 10% 
of the speckle background does not average at 
about 1" (figure 1 in Boccaletti et al. (2003)). 
Through careful PSF calibration, some of the 
residual speckle structure can be subtracted (Roth 
ct al. 2001), yielding a point source detection limit 
more than 10 times fainter than the PSF back- 
ground level within the central arcsecond. This 
factor tends to become larger with increasing dis- 
tance from the PSF core, partially thanks to the 
chromatic elongation of speckles which makes the 
PSF smoother. Differential imaging techniques 
based on spectral properties of the source could 
further increase this factor by up to 10 2 for simul- 
taneous imaging in 2 bands and 10 4 for simultane- 
ous imaging in 3 bands (Marois et al. 2000) . Tech- 
niques using the coherence properties of speckles 
have also been proposed (Boccaletti et al. 1998; 
Guyon 2004). 

In this study, performance of an adaptive op- 
tics system is quantified by the ratio between the 
light intensity at the point of the PSF considered 
and the light intensity at the PSF's center. This 
quantity is referred to as the PSF contrast in the 
rest of the paper. The goal of this work is to give 
limits on this PSF contrast achievable with AO 
systems, and to propose solutions to reach these 
limits: which WFS to choose ? how to drive the 
dcformable mirror (DM) ? is a coronagraph nec- 
essary ? if yes, which one ? Detection limits for 
faint companions are significantly harder to pre- 
dict than PSF contrast for the reasons detailed 
above, and will not be computed in this paper. 

The PSF contrast in a photon-noise limited AO 
system is a function of the ability of the WFS to 



accurately measure the corresponding spatial fre- 
quency in the pupil plane phase. Analytical ex- 
pressions of the fundamental contrast limits im- 
posed by photon noise and chromaticity of the 
wavefront are derived is §2. Aliasing effects are 
discussed in §3, and solutions to reduce their im- 
pact on the PSF contrast are proposed. In §4, the 
sensitivity of common WFSs to photon noise is 
discussed, and a WFS based upon Zernike's phase 
contrast is shown to offer optimal sensitivity. Re- 
sults of §2, §3 and §4 are combined and discussed 
in §5 to derive realistic limits to the PSF contrast 
in ground-based AO systems and identify optimal 
approaches to detect extrasolar planets. §5 shows 
that a focal plane WFS can be especially advanta- 
geous, and this option is discussed in more detail 
in §6. In §7, I discuss the need for coronagraphy 
and the choice of the correct coronagraph to reach 
the PSF contrast derived in this study. 

2. Fundamental limits of wavefront sens- 
ing for high contrast AO: analytical ex- 
pressions 

2.1. Speckles and wavefronts 

Notations used in this work are given in table 
1. In this paper, I consider a "perfect" corona- 
graph: if the entrance pupil of the system had no 
phase aberrations, the focal plane light intensity 
would be equal to outside the Inner Working 
Angle (smallest angular separation at which the 
coronagraph can be used to detect a faint compan- 
ion, denoted IWA in the rest of the paper) of the 
coronagraph within the spectral bandwidth con- 
sidered. All observations are made at the zenith: 
atmospheric dispersion is not taken into account. 

The pupil plane complex amplitude is denoted 

W{u) = A(u) e^ (s) (1) 

where A{u) is the amplitude and 4>{u) the phase of 
the wavefront. The pupil plane phase aberration 

cf>(u) = ^ cos (2k fu + ij (2) 

creates 2 symmetric images of the central PSF 
(Malbet et al. 1995): 

7(a) = PSF(a)+ (^) [PSF(a+f\)+PSF(a-f\)} 
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where A is the imaging wavelength, h <C A is the 
amplitude (in meter) of the sine wave phase aber- 
ration of spatial frequency /, and a — /A is the 
angular coordinate on the sky. The phase of these 
speckles are ir/2 — 9 and tt/2 + 0. Similarly a 
multiplicative amplitude error 

A(u) = l + a cos (2tt/u + 6») (4) 

creates 2 symmetric speckles of phases —9 and 9 
on either side of the "ideal" image PSF (a): 

1(a) = PSF(a)+(^y [PSF(d+f\)+PSF(d-fX)}. 

(5) 

To simplify notations in the rest of the paper, I 
will denote / = |/|, u = \u\ and a = \a\. 

If a perfect DM is used, the ability of the AO 
system to suppress light at the position a is then 
given by the ability of the WFS to measure the 
phase and amplitude of the corresponding spatial 
frequency in the pupil plane phase. In an opti- 
cally "perfect" system (no wavefront or amplitude 
errors introduced by the optical elements, noiseless 
detector), the effects that limit the performance of 
WFSs for high contrast imaging are: 

• Photon Noise in the WFS. 

• Chromaticity of the optical pathlcngth dif- 
ference (OPD) and amplitude between the 
WFS wavelength A and the imaging wave- 
length Xi. 

• Aliasing. The wavefront measurement is 
corrupted by higher spatial frequency aber- 
rations that propagate into modes which are 
detected by the WFS. 

In this section, the first two effects are discussed, 
while aliasing effects are studied separately in §3. 
Table 2 lists the terms computed analytically in 
this section: 

• Co: PSF contrast limit imposed by OPD 
aberrations in uncorrected atmospheric tur- 
bulence. 

• C\\ PSF contrast limit imposed by ampli- 
tude aberrations in uncorrected atmospheric 
turbulence (scintillation) . 



• C2: PSF contrast limit imposed by residual 
OPD aberrations after AO correction. This 
term is computed analytically in this section 
as a function of the WFS sensitivity to pho- 
ton noise (i p . Using the technique proposed 
in appendix A, j3 p is computed for different 
WFSs in §4. 

• C3: PSF contrast limit imposed by residual 
amplitude aberrations after AO correction of 
OPD and amplitude. 

• C4: PSF contrast limit imposed by the dif- 
ferential OPD between the WFS and imag- 
ing wavelengths. This term is caused by the 
chromaticity of Fresnel propagation. 

• C5: PSF contrast limit imposed by the dif- 
ferential scintillation between the WFS and 
imaging wavelengths. This term is caused 
by the chromaticity of Fresnel propagation. 

• Cq : PSF contrast limit imposed by the dif- 
ferential OPD between the WFS and imag- 
ing wavelengths. This term is caused by the 
chromaticity of the refraction index of air. 

The terms C, are computed for high levels of cor- 
rection (Strehl ratio m 1), which allows simplifica- 
tion of most equations. The final PSF contrast C, 
computed as a function of angular separation, is 
then obtained as follows: 

• No AO correction: C = Co + C\. 

• AO correction of phase only: C = C\ + 

C2 + C4 + C§. 

• AO correction of phase and amplitude: 

C = C2 + C3 + C4 + C5 + 6*6 . 

2.2. Uncorrected atmospheric turbulence 

(C and Ci) 

In the paraxial approximation, Fresnel propa- 
gation of wave of complex amplitude W(u, 0) over 
a distance z produces a wave W(u, z) described by 

W(u, z) = W{u, 0) ® exp(iiru 2 /zX) (6) 

where ® is the convolution operator. This is equiv- 
alent to a phase shift of each spatial frequency 
component of the wavefront by 

d<t> = 7T f z X. (7) 
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Table 1 
Notations and units 





symbol 


unit 


Telescope diameter 


D 


m 


Detection wavelength 


Ai 


m 


WFS wavelength 


A 


m 


Seeing wavelength 


A 


m 


Fried parameter 


n> at A 


m 


Wind speed 


V 


m.s -1 


Source brightness 


F 


ph.s _1 .m 


Angular separation 


a 


rad 


Turbulence altitude 


z 


m 


Turbulence profile 


Cl{z) 




WFS sensitivity to OPD 


P P 




WFS sensitivity to amplitude 


Pa 





Table 2 

Contributions to the PSF contrast 



Uncorrected atmospheric OPD Co 

Uncorrected atmospheric amplitude C\ 

Residual atmospheric OPD after correction C2 

Residual atmospheric amplitude after correction C3 

OPD chromaticity (Fresnel propagation) C4 

Scintillation chromaticity (Fresnel propagation) C5 

Refraction index chromaticity Cq 
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A pupil plane complex amplitude 
27rft 



W(u, 0) = 1 + i-^- sin (2iruf + (?) (8) 



therefore becomes 



2tt/i 



W(u, z) = 1 + sin(d</>) x — ^- sin ylnuf + 

+i cos(d0) x sin ^2ttu/ + ffj . (9) 

This equation shows that Fresnel propagation of 
a pure sine wave phase aberration produces both 
an amplitude and a phase aberration of identi- 
cal spatial frequency in the pupil plane. This 
effect is periodic for each spatial frequency, as 
W(u, z + z T ) = W(u, z) with z T = 2// 2 A the Tal- 
bot distance (Talbot 1836). 

For ground-layer Komogorov atmospheric tur- 
bulence, the power spectrum of the 2D phase is 



<£(/) 



0.023 



'•o 



5/3 



-11/3 



(10) 



where r is the Fried parameter. In a telescope 
pupil of diameter D, the power given by a sin- 
gle spatial frequency is obtained by integration of 
<f>(f) over a 2D domain of width proportional to 
1/D. Through numerical simulations, the corre- 
sponding amplitude (in meter) of the sine-wave 
component of spatial frequency / is computed: 



, m= 0-22 Aq 

n v> fii/e Dro 5/6- 



(11) 



where Ao is the wavelength at which r is mea- 
sured. 

Taking into account the Fresnel propagation 
given in equation 9, the following expressions are 
obtained for the OPD and amplitude components 
of atmospheric turbulence in equations 2 and 4: 



HI) 

a(f) = 



0.22 A 



/H/6 £) ro 5/6 

2tt 0.22 A 
Xi /u/6Dro 5 / 6 



y/X(f,Xi) (12) 
VnJJT) (13) 



where 



y, f y, JC^COS^zfXJdz 



and 

Y(fx] SCl{z)W(.zP\ t )dz 
Y[LXi) JCUz)dz i-Mf'W- 

(15) 

Since Fresnel diffraction is chromatic, X and Y are 
function of \ . X is the fraction of the atmospheric 
turbulence which produces phase errors, the re- 
maining part producing amplitude errors (scintil- 
lation). For low altitude turbulence and/or low 
spatial frequencies, X«l: the beam propagation 
length is too short to allow Fresnel diffraction to 
transform phase errors in amplitude errors. By 
combining equations 3 and 12, since / = a/ Xi, at- 
mospheric phase aberrations produce the following 
contrast at A;: 



C (a) = 



0.484 tt 2 V X^ /6 Xja/X^X,) 

a ll/3 D 2 rf) 5/3 



(16) 



Similarly, from equations 5 and 13, the amplitude 
aberrations (scintillation) produce the following 
contrast at Aj: 

nf , 0-484 ir 2 X 2 X^ 3 Y(a/X t ,X t ) 

Cl(a) = a lV3^ ro 5/3 • 

Since X + Y = 1, the combined contribution of 
phase and amplitude aberrations in the PSF con- 
trast is independent of the turbulence altitude. 

2.3. Effect of WFS photon noise and time 
lag on the corrected phase (C2) 

In the Taylor approximation used in this work, 
atmospheric turbulence is moving in front of the 
telescope pupil at a speed v (wind speed along 
the direction a considered). In a closed- loop AO 
system, the corrected amplitude h c of the spatial 
frequency considered is quadratic sum of a com- 
ponent due to time lag and a component due to 
photon noise (given by equation. A23) : 



he 



\ 



^\Jt F it _D 2 /4 j 
(18) 

where t is the WFS sampling time, F is the source 
brightness (in ph.s -1 .m~ 2 ) and D is the telescope 
diameter. 

h c is minimal for 

, = /A\ 2/3 0.204 ^ /3 r 5 / 9 / 5 / 9 

th [XoJ F^ v 2/s X (a/X i ,X i )^ ^> 
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The corresponding residual error produces a 
symmetric pair of speckles (equation 3) with a con- 
trast to the central PSF peak: 



C 2 {a) = 0.7475 



A 4 / 3 Aq 2 / 3 ^ /3 t> 2 / 3 X(a/\j, A;) 1 / 3 

A .13/9 F 2/3 D 2 ro 5/9 a 5/9 

(20) 

where Aj is the wavelength at which the final image 
is obtained, and might be different from A, the 
wavefront sensing wavelength. 

2.4. Effect of WFS photon noise and time 
lag on the corrected amplitude (C3) 

The light intensity distribution (scintillation) in 
the pupil plane can be measured and corrected for. 
In the frozen turbulence flow model adopted in this 
work, scintillation is an amplitude screen moving 
in front of the telescope. The effect of photon noise 
and time lag on the corrected amplitude a c can 
therefore be written 



'(27ra(j>i/) 2 + 



t F n D 2 /4 ' 



(21) 



This equation is identical to equation 18 if h c and 
h(f) are replaced by Xia c /2n and Xia(f)/2n re- 
spectively. The optimal sampling time is therefore 



to = 



A 



2/3 



0.204 /? 2/3 r 5 / 9 / 5 /9 



(22) 



X Q J Fl/3„2/3y( a y A . A .)l/3' 

and the corresponding contrast C3 is 

.A^Ao^/V/ay^/A.^i/s 



C 3 (a) = 0.7475- 



Aj 13/9 p2/3 D 2 a 5/9 r 5/9 



(23) 



2.5. Chromaticity of OPD and scintilla- 
tion 

2.5.1. OPD chromaticity produced by Fresnel 
propagation (C4) 

Fresnel propagation is chromatic, and the OPD 
at the telescope pupil is therefore chromatic. 
When perfectly corrected at one wavelength (the 
WFS wavelength), the OPD in the imaging wave- 
length will show a small residual which limits the 
achievable contrast to: 

„ , , Co (a) dX(a/\j, \j, A) 

Ci{a) = — xRu) — (24) 



where 



dX(f, A,, A) = 



/C 2 (z) (cos(nzf 2 X l ) - cos(7rz/ 2 A)) 2 dz 



JCUz)dz 



(25) 



2.5.2. Scintillation chromaticity (C5) 



Similarly, Fresnel propagation produces wavelength- 
dependent intensity variations in the pupil plane. 
This produces a limit C5 on the achievable con- 
trast : 

C 5 (a) = giMfj"/^^), (26) 



where 



dY(f,Xi,X) 



JCl(z) (sin( 7 rz/ 2 A l ) - sin(7rz/ 2 A)) 2 dz 



ICl{z)dz 



(27) 



2.5.3. Chromaticity of the air refraction index 
(C 6 ) 

The index of refraction of dry air at stan- 
dard temperature and pressure is wavelength- 
dependent (Edlen 1966): 



n(A) = 1.0+8.3421310~ 5 + 



0.0240603 0.00015997 
; + -. 



130 -A- 2 38.9 -A- 2 
(28) 

The corresponding PSF contrast is 



C 6 (a)=C (a) 



n(Xj)-n(X) 
l-n(Xi) 



(29) 



3. Aliasing effects 
3.1. WFS aliasing 

The pupil OPD and amplitude aberrations can 
only be corrected by the AO system below a cutoff 
spatial frequency / c , because of limited sampling 
in the pupil plane DM and/or in the WFS. For 
the contrast expressions derived in §2 to be appli- 
cable, the measurement accuracy of a pupil plane 
phase aberration of spatial frequency / < f c at 
the WFS wavelength A must be limited by photon 
noise. Unfortunately, measurement of an OPD or 
amplitude aberration of frequency f < f c , even in 
the absence of photon noise, can be corrupted by 
aliasing: other spatial frequencies (usually above 
f c , but not always) can create a WFS signal at 
frequency /. 
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The optical part of the WFS (before the detec- 
tor) docs not produce aliasing: a phase aberra- 
tion at frequency / only creates an optical signal 
of frequency / in the pupil plane. This signal can 
be spot displacements (for Shack-Hartmann WFS) 
or intensity modulation (for curvature WFS for 
example). Aliasing is therefore produced by the 
limited sampling of the measurement in the pupil 
plane. 

Two approaches exist to suppress or mitigate 
aliasing in WFSs: 

• Increasing the WFS spatial sampling. 

If a good detector (low readout noise and 
dark current) is used, this solution can be 
highly successful for all but one of the WFS 
considered in this work. The single excep- 
tion is the Shack-Hartmann WFS, where 
an increase of spatial sampling in the pupil 
plane (smaller subapertures) increases the 
measurement error on low-order modes due 
to photon- noise (see section §4.1). 

• Preventing spatial frequencies above f c 
to be "seen" by the optical part of the 
WFS. This can be done by spatial filter- 
ing in the focal plane (Poyneer & Macintosh 
2004) or by using an anti-aliasing optical fil- 
ter before the detector (Takato 2005). These 
solutions reduce aliasing on all pupil-plane 
WFSs, and are most effective if the WFS 
sampling is regular, as is usually the case. 
Curvature WFSs include a focal plane iris 
at the vibrating membrane (usually to re- 
duce stray light and sky background), which 
can be used to reduce aliasing in high-Strchl 
regime. Anti-aliasing optical filters are rou- 
tinely used in imaging with CCDs, and are 
often placed immediately before the detector 
in commercial digital cameras. Anti-aliasing 
optical filters with total rejection of high 
spatial frequencies can be designed (Leger 
ct al. 1997) for monochromatic light, and 
their performance in white light is still good: 
the solution proposed by Leger et al. (1997) 
reduces aliasing by a factor 24 with a 20% 
bandpass. 



3.2. Algorithm used to compute aliasing- 
free DM control signals 

If the focal plane complex amplitude (or, equiv- 
alently, the aliasing-free pupil plane complex am- 
plitude) is perfectly known up to a spatial fre- 
quency f c , it is possible to drive a DM to cancel 
focal plane speckles within a region of the image 
corresponding to spatial frequencies lower than 
f c . Malbet et al. (1995) proposed to use a non- 
linear minimization algorithm to control the DM. 
In this work, a Gershberg-Saxton algorithm (Ger- 
chberg & Saxton 1972) is proposed to find the op- 
timal DM control signals. Since this method is 
tailored at finding the solution of a problem with 
constraints on both a function (pupil plane com- 
plex amplitude) and its Fourier transform (focal 
plane complex amplitude), it seems naturally well 
adapted (Faucherre et al. 1989). The steps of the 
algorithm arc shown in figure 1: 

• (1) The region of the focal plane within 
which the speckles are to be canceled is first 
chosen. This "diffraction control domain" 
(DCD) should exclude the central part of 
the PSF and should not extend beyond the 
pupil spatial frequency defined by the DM 
actuator size or WFS sampling. This region 
can be within a half plane if amplitude er- 
rors (scintillation) in the pupil plane are ex- 
pected, or can include both sides of the PSF 
for correction of OPD aberrations only. 

• (2) The focal plane complex amplitude is 
multiplied by the DCD to represent the com- 
plex amplitude that should be canceled by 
the DM. 

• (3) The 2D complex function computed in 
step (2) is Fourier transformed to produce 
the "ideal" pupil plane complex amplitude 
required to cancel the speckles within the 
DCD. 

• (4) The pupil plane complex amplitude func- 
tion is "projected" on the DM: for each ac- 
tuator of the DM, the phase which best 
matches the function computed in step (3) 
is used to update the DM state. 

• (5) Using the updated DM state and the 
initial measured focal plane complex am- 
plitude, the updated complex amplitude in 
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the focal plane is numerically estimated, and 
steps (2) to (5) can then be repeated. 



This loop is 
repeated 
several times 










> 








initial pupil phase 



residual pupil phase 



focal plane image 



Fig. 1. — - Proposed algorithm used to drive the 
DM in a closed-loop AO system. 

This algorithm converges very rapidly: only a 
few iterations are required to obtain a high con- 
trast (about 10~ 10 ) if the initial PSF aberrations 
are low. It is also computationally less greedy than 
the solution proposed by Malbet et al. (1995) (the 
computing time is dominated by 2 Fourier trans- 
forms), and can therefore be implemented in a fast 
closed loop control system. 

This algorithm is also very flexible and per- 
forms well in non-ideal conditions. Non-ideal DM 
characteristics such as irregular actuator shapes, 
"dead" actuators or coupled influence functions 
can easily be included in the algorithm (step (4)) 
with minimal cost in complexity or computation 
time. For example, figure 2 shows that a solution 
yielding good PSF contrast can be found even if 
the PSF has large aberrations and if the DM has 
insufficient stroke to fully correct them. This par- 
ticular example illustrates how the algorithm is 
able to find a solution for which the diffraction 




Initial conditions 










Result after convergence 









Fig. 2. — Example of diffracted light suppression 
using the algorithm detailed in §3.2. In this case, 
the DM stroke is limited to ±0.4 radian (±0.8 
radian of phase correction), and the initial pupil 
phase aberration is about 3 radian from peak to 
peak. The DM actuators are square-shapes, and 
their influence functions are overlapping (each DM 
influence function is a square convolved by a Gaus- 
sian). In this example, there are 50 actuators 
across the diameter of the pupil (2000 actuators 
total). 



within the DCD is canceled even though the resid- 
ual phase aberrations in the pupil plane are still 
large: these aberration are confined to either high 
or low spatial frequencies, but are very small in 
the spatial frequency range corresponding to the 
DCD. It should be noted that in such non-ideal 
conditions, the number of iterations (steps (2) to 
(5)) required to converge can be quite high (about 
100 in this example). 

3.3. Closed loop operation 

In a closed loop AO system, the DM control 
algorithm proposed in §3.2 is used to compute fre- 
quent but small updates of the DM: the phase 
function in the pupil plane needed to cancel the 
speckles is very small. The number of iterations 
(steps (2) to (5) of the algorithm) required within 
the algorithm is therefore small (a single itera- 
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tion is sufficient). The computing time can con- 
sequently be made compatible with kHz update 
rate on modern computers for systems with up 
to fO 4 actuators: on a f 28xf 28 actuators system 
(13000 actuators on the circular pupil) with a fo- 
cal plane image sampling such that the DCD oc- 
cupies 256x256 pixels, the time required for the 2 
Fast Fourier Transforms is about 1ms on a modern 
computer. 

The number of photons per focal plane speckle 
is typically less than 10 per sampling period, and 
the corresponding relative error on the measured 
complex amplitude of the speckle due to photon 
noise is more than 10%. The closed loop perfor- 
mance of the AO correction is therefore not sen- 
sitive to small errors introduced by the algorithm 
described in §3.2. Even a 5% error in the knowl- 
edge of the DM response has a negligible impact 
on the system performance. 

4. Wavefront sensors sensitivities 

The sensitivity of a WFS is a quantitative mea- 
sure of how photon noise affects its measurement 
of OPD or amplitude. In this section, the sensitiv- 
ity P p of WFSs for OPD sensing (when only OPD 
is measured by the WFS) is computed. /3 P is used 
in equation 20 to estimate the contribution Ci of 
WFS photon noise to the PSF contrast. For each 
WFS, I show how Ci varies as a function of angu- 
lar separation, and how WFS design parameters 
affect it. An exact definition of (3 P and details on 
how it is computed are given in appendix A. The 
sensitivity j3 a for amplitude sensing is given within 
the discussion in §4.7. 

The results obtained in this section are only 
valid for small residual phase variance at the wave- 
front sensing wavelength. 

4.1. Shack-Hartmann WFS 

In a Shack-Hartmann WFS (SHWFS), the 
quantities Ik measured are spot displacements. 
The associated noises for a diffraction-limited 
spot, in the absence of background light, are 
(Hardy 1998) 



0.277 A 

den \/ N an 



(30) 



for a continuous noiseless detector and 
0.500 A 



(31) 



for a quad cell detector. In the above equations, 
d sa is the subaperture size and N sa is the number 
of photons per subaperture. To account for atmo- 
spheric turbulence within each cell, l/d sa should 

be replaced by \Jl/d sa 2 + l/r 2 . 

Using the equations detailed in appendix A, the 
following results are obtained: 

• For a Shack-Hartmann WFS with a contin- 
uous noiseless detector: 




(32) 



For a Shack-Hartmann WFS with noiseless 
quad cells: 




(33) 



where d sa is the subaperture size. For both equa- 
tions, / d sa < 1/3 (minimum of 3 lcnslet per sine 
wave period), as lower pupil plane sampling in- 
crease Pp. For example, with 2 lcnslets per period, 
if the center of lenslets coincide with the crests and 
peaks of the sine wave phase aberration, no signal 
will be produced by the SHWFS. 

With a SHWFS, photometry of the spots can 
be used to measure amplitude variations in the 
pupil plane, without altering the accuracy of the 
phase measurement: the sensitivity (3 P is main- 
tained even if OPD and scintillation are measured 
simultaneously. 

The PSF contrast component C2 achievable 
with a SHWFS is shown in figure 3 for subaper- 
ture sizes ranging from 2cm to 70cm. for each 
subaperture, a continuous noiseless detector was 
assumed, rather than a less sensitive quad-cell. In 
the inner region of the PSF, the contrast C2 de- 
creases as the -17/9 power of angular separation 
(equations 32 and 20). No correction is possible 
beyond the sampling limit of the WFS: the con- 
trast C2 reaches a minimum value at this tran- 
sition point. The constrast C 2 at small angular 
separations is independant of the number of sub- 
apertures if the subaperture size is larger than the 
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are taken per spatial period: 



Xi 1e-05 - 



70cm 



40cm 



20cm 



separation (arcsecond) 




Fig. 3. — PSF contrast component Ci with Shack- 
Hartmann WFS using various subaperture sizes. 
The continuous line shows the PSF contrast com- 
ponent Co without correction of the atmospheric 
turbulence. Parameters used for this simulation 
are listed in table 4. 

seeing. However, if subapertures are smaller than 
ro, diffraction by each subaperture increases the 
subaperture's focal plane spot size and therefore 
reduces the sensitivity of the WFS. It therefore 
seems impossible to simultaneously optimize the 
contrast over a wide range of separations. 

To achieve the optimal performance shown in 
figure 3, the wavefront integration time th needs 
to be proportionnal to a -1 / 9 . 

4.2. Curvature wavefront sensor (CWFS) 

In a curvature WFS (Roddier 1988; Roddier et 
al. 1991), a spherical phase aberration is intro- 
duced in the focal plane, which is equivalent, in the 
pupil plane, to Fresnel propagation. The pupil im- 
age is therefore "conjugated" to an altitude which 
is set by the amplitude of the focal plane phase 
aberration. Equation 9 shows that Fresnel propa- 
gation of a pure sine wave phase aberration pro- 
duces both an amplitude and a phase aberration of 
identical spatial frequency in the pupil plane. The 
curvature WFS therefore transforms phase aberra- 
tions into light intensity modulations in the pupil 
plane. 

The WFS measures intensities Ik in the pupil 
plane. I assume here that N such measurements 



h = 



Nph 
N 



1 



4 7T A sin(d0) 
A 



/2nk 
Sm {— 



(34) 

with a Ik = N ph /N. 

Using the method detailed in appendix A, the 
following expression for (3 p is obtained: 

' ir Sz A a 2 ' 



/3 p (a) = sin 1 



A; 



(35) 



where Sz is the conjugation altitude of the pupil 
plane (in curvature AO systems, 2 pupil plane im- 
ages are usually acquired, at conjugation altitudes 
+Sz and — Sz). This result is independant of N 
for N > 2. 

The PSF contrast component C2 achievable 
with a CWFS is shown in figure 4. The amount of 
defocus introduced in the focal plane of the CWFS 
can be adjusted to tune its sensitivity to an opti- 
mal spatial frequency in the pupil plane (for which 
the term in the sine of equation 35 is n/2). In 

8m telescope, CWFS, mv=5 star 




1600km 



400km 



25km 



separation (arcsecond) 

Fig. 4. — PSF contrast component C2 obtained 
with a curvature WFS using dcfocalization dis- 
tances ranging from 25km to 1600km. Parameters 
used for this simulation are listed in table 4. 

the inner regions of the PSF, the contrast C2 de- 
creases as the -29/9 power of the angular separa- 
tion (equations 35 and 20), which is significantly 
steeper than for a SHWFS. This steep increase of 
wavefront error at low spatial frequencies is also 
referred to as "noise propagation", and is known 
to be more serious for CWFS than for SHWFS. 
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Soon after the contrast reaches a minimum, the 
defocus distance becomes too large (the sine in 
equation 35 becomes close to zero) and no reliable 
correction can be applied to the wavefront. Theo- 
retically, correction of higher spatial frequencies is 
possible as the sine in equation 35 periodically os- 
cillates between 1 and -1, but this possibility was 
not considered in figure 4: in a real CWFS, spec- 
tral bandwidth and time evolution of Sz (usually 
closer to a sine wave than a step function) prevent 
this feature from being usable. 

At small angular separations, the wavefront in- 
tegration time th for a CWFS needs to be pro- 
portionnal to a~ 7 / 9 to achieve the result shown 
in figure 4. This suggests that a curvature WFS 
can greatly benefit from a modal control scheme, 
where the correction speed can be adjusted for 
each spatial frequency, as opposed to a zonal re- 
construction with a fixed integration time. 

Although the optimal constrast region is nar- 
rower in a CWFS than it is with a SHWFS, is also 
is deeper: at a given separation, a properly tuned 
CWFS performs better than a SHWFS. This is 
especially true close to the PSF center, where a 
"tuned" CWFS can reach a sensitivity (3 P = 1. 
Since changing the extrafocal distance in a CWFS 
is usually very easy, it is in fact possible to contin- 
uously move the optimal contrast region between 
small and large angular separations during an ob- 
servation. The equivalent technique would be op- 
tically more complex in a SHWFS, as the subaper- 
tures size would need to be modified. 

4.3. Pyramid WFS 

The pyramid WFS (Ragazzoni 1996) divides 
the focal plane in 4 quadrants, each one bee- 
ing then reimaged in a separate pupil plane. In 
the geometrical optics approximation, wavefront 
slopes can be measured as contrast between pairs 
of pupil images. The focal plane point which de- 
fines the position of the quadrants (the "center" 
of the pyramid) can be rapidly rotating around 
the PSF core to increase linearity and dynamical 
range, at the expense of sensitivity. 

I denote Pi(x, y) the pupil image corresponding 
to quadrant i, as shown in figure 5, and P re f(x, y) 
the pupil image in the absence of a focal plane 
pyramid. The pyramid WFS can be operated in 2 
ways: 



Configuration A 



Configuration B 



Configuration C 
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Fig. 5. — Focal plane images (top) and corre- 
sponding pupil images Pi (bottom) for a sine- wave 
pupil phase error (corresponding to 2 symmetric 
speckles in the focal plane). See text for details. 

• Fixed pyramid position. The top of the 

pyramid (junction point between the 4 quad- 
rants) is at the center of the PSF core, cor- 
responding to configuration A in figure 5. 

• Modulation of pyramid position. The 

top of the pyramid is moving on a circle of 
radius r p . Configurations B (the central PSF 
core and one speckle are within the same 
quadrant) or C (the quadrant containing the 
PSF core either contains no speckle or both 
symmetric speckles) can then occur. 

4-3.1. Fixed pyramid position 

I denote P® the pupil image corresponding to 
quadrant i obtained in configuration A in the ab- 
sence of phase aberrations for a unobstructed cir- 
cular pupil. Ff consists of a fainter pupil image 
with bright sharp edges and a significant fraction 
of the light diffracted outside the geometric pupil. 
Although the total light in each pupil image P° 
is one quarter of the original pupil image, the to- 
tal light within the geometric pupil (excluding the 
bright edges) is about 6% of the original pupil. I 
consider a pupil complex amplitude 

W(u,0) = l + i— sin (W/ + (36) 

corresponding to a set of 2 symmetric speckles as 
shown in figure 5. In configuration A, these 2 
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speckles interfere with Pf in quadrants 1 and 3: 



Pi 



Pi 



in ^2iruf 



P2 = P 2 ° 

= P°-^pOp re/X ^sin 

p 4 = p? 



(2ttu/ 



0) (37) 
(38) 

0) (39) 
(40) 



The pyramid WFS therefore directly measures 
the pupil phase (Verinaud 2004; Verinaud et al. 
2005): this is somewhat different from the geo- 
metrical optics understanding of this concept, in 
which phase slope is measured by the pupil im- 
ages. Due to the splitting of the focal plane into 
4 zones, reconstruction of the full wavefront map 
requires Pi, P 2 , P3 and P4. For example, pupil 
images Pi and P3 are only sensitive to pupil phase 
spatial frequencies corresponding to zones 1 and 3 
of the pyramid. 

Using the method detailed in appendix A, 



p = y/2. 



4-3.2. Modulation of pyramid position 



(41) 



I consider a motion of the pyramid center on a 
circle of radius r p > X/d, with no change in the 
orientation of the pyramid. As shown in figure 
5, configurations B and C occur as the pyramid 
moves. In configuration B on figure 5, within the 
geometric pupil: 



Pi 



ref 



1 



2irA 



sin 



(2™/ 



P2 = Ps = Pi = 



(42) 



(43) 



In configuration C, Pi = P re f for the pyramid zone 
containing the PSF core, and Pi = for the other 
pupil images. In a long exposure (longer than the 
modulation time of the pyramid position) : 



P = P 

1 7. — 1 1 



ref 



2ttA 



sin 



(2™/ + ^ 



c 



where ff and ff are the fraction of the time dur- 
ing which the PSF core is in the pyramid quadrant 
i and the configuration is B and C respectively. 
The sign of the modulated intensity signal is op- 
posite between Pi and P3, and between P 2 and 



P 4 . Since ff + ff = 0.25 (the PSF core spends a 
quarter of its time on each zone of the pyramid) , 
in a long exposure (longer than the modulation 
time of the pyramid position) : 



P, 



ref 



1±4// 



.2irA 



sin 



(2™/ + ^ . 



(45) 

Each of the 4 pupil images contains the intensity 
modulation, but with different signal levels. Since 
fi = ff and f® = fi ' , using the method detailed 
in appendix A, 



2V2 



(4/f) 2 



(46) 



Figure 6 shows how (3 p varies across the focal 
plane. It is minimum at the pyramid modulation 
radius and increases rapidly toward the center of 
the PSF. 




Angular separation 



Fig. 6. — Value of f3 p in a modulated pyramid 
WFS. A 2D map of (3 P is shown on the left, and a 
averaged radial profile is plotted on the right. 



4-3.3. Discussion 

Figure 7 shows the contrast component C 2 for 
modulated and fixed pyramid WFSs. The sensi- 
tivity of the pyramid WFS is better if the pyramid 
is fixed, and this mode of operation should be pre- 
ferred in high-contrast AO on bright sources, as 
the linearity range of the WFS is then not a con- 
cern. If the pyramid is fixed, it may be replaced by 
a "roof top" (a pyramid with only 2 faces), which 
would offer the same sensitivity with 2 pupil im- 
ages instead of 4. 

4.4. Mach-Zehnder pupil plane interfer- 
ometer 

In this wavefront sensing scheme suggested by 
Angel (1994), a beam splitter produces two copies 
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8m telescope, PYRWFS, mv=5 star 




separation (arcsecond) 

Fig. 7. — PSF contrast component C2 obtained 
with a pyramid WFS with and without modula- 
tion of the pyramid's position. C2 is plotted for 
modulation radii ranging from 0'.'2 to 2". The top 
curve shows the PSF contrast Co corresponding to 
uncorrected turbulence phase aberration. Param- 
eters used for this simulation are listed in table 
4. 



of the same wavefront. One of the copies is 
spatially filtered and interferometrically combined 
with the unfiltered wavefront. Wavefront phase is 
transformed into intensity variations in the 2 pupil 
images produced by the interferometer. Minimum 
sensitivity is reached when the first beam splitter 
is symmetric (50/50 beam splitter), and the inter- 
ferometer's OPD is such that the 2 pupil images 
have the same brightness when the wavefront is 
perfect. The interferometer's OPD may be modu- 
lated to increase dynamical range. 

The WFS measures intensities Ik in the two 
pupil plane. I assume here that 27V such measure- 
ments are taken per spatial period (N measure- 
ments per pupil image): 



h = 



Nph 
2N 



1 ± sin 



2tt A 
A 



/2tt£; 



/ (47) 

with <7i k = y / N p f l /2N, and the sign in front of the 
sine is different for each pupil image. 

Using the method detailed in appendix A, the 
following expression for (3 P is obtained: 



Figure 8 shows the PSF contrast component C2 
obtained with a PPMZWFS. 

4.5. Focal plane WFS 

In a FPWFS, the amplitude and phase of fo- 
cal plane speckles are measured by inducing inter- 
ferences between the focal plane complex ampli- 
tude and a set of known "reference waves" (An- 
gel 2003). Optical configurations to produce the 
reference waves and measure the interferences are 
discussed in §6. 

The amplitude and phase of a focal plane 
speckle created by the sine- wave pupil phase error 
are 

(49) 
(50) 

to which correspond the real and imaginary parts 
of the speckle 




x s = A s cos(0 s ) = 



Vs 



A s sin(</> s ) 




(51) 



(52) 



where xo and yo follow the notations used in ap- 
pendix A. x and y are estimated through the 
measurement of N intensities: 

I k = { ^= +Xk f + { ^= +Vk f (53) 

where k = 0...N—1, and Xk, yk are the N reference 
waves with which the speckle light interferes. The 
total number of photons in the speckle is equally 
shared between the N measurements: x s and y s 
are therefore divided by \/N . 



dxo 

dh 
dyo 



N 



ph 



N 



(54) 
(55) 



& = 2. 



(48) 



Since the measurement noise on Ik is \J Xk 2 + yk 2 
(photon noise), the estimate of x s and y s from the 
measurement of Ik is insensitive to the amplitude 
of the reference wave (but not its phase). For ex- 
ample, multiplying the light level in the reference 
wave by 4 will double and i^-, and will also 

double the measurement noise \J Xk 1 + yk 2 - The 
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problem is therefore reduced to finding a set of 
phases for the reference waves. 

Numerical simulations using the method de- 
tailed in appendix A show that a minimum of 2 
reference waves are needed. The optimal perfor- 
mance (minimum value of S) is reached when the 
2 waves are offset by tt/2: 

P P = 2. (56) 

Increasing the number of waves does not lead to 
better solutions. If the wavefront does not contain 
amplitude variations (no scintillation), the sym- 
metry property of the focal plane speckles allows 
the use of 2 speckles to sense a single pupil plane 
spatial frequency, in which case: 

0p = y/2. (57) 

Figure 8 shows the PSF contrast component C2 
obtained with a FPWFS. 

8m telescope, mv=5 star 
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Fig. 8. — PSF contrast components C2 obtained 
with PPMZWFS, FPWFS and ZWFS. The top 
curve shows the PSF contrast Co corresponding 
to uncorrected turbulence phase aberration. Pa- 
rameters used for this simulation are listed in table 
4. 

4.6. Sensitivity of an "Ideal" WFS 

Using the (U, S c ) representation of WFSs given 
in appendix B, the unitary matrix U can be opti- 
mized for the wavefront sensing of a pure sine wave 
phase error of fixed frequency / by finding the 
smallest possible value of (i p . Random WFSs can 



be built and the corresponding value of (3 P com- 
puted using the equations detailed in appendix A. 
The quantities Ik measured are light intensities 
(number of photons), and the associated noise is 
y/Ik. The numerical simulation results show that 
the value of /3 P obtained is then independant of the 
size of the unitary matrix used to represent WFSs 
(as long as this matrix is larger than 3x3) : 

Pr = 1. (58) 

Theoretically, an "optimal" WFS should there- 
fore be able to have a sensitivity to photon noise 
(3 p = 1. This result does not however insure that 
a WFS which can satisfy this requirement simul- 
taneously for all spatial frequencies exists, as the 
above simulation was performed for a single spa- 
tial frequency. The results obtained in this work 
show that only the CWFS reaches this optimal 
sensitivity, but only for a single value of the spa- 
tial frequency /. Understanding how the CWFS 
achieves this result might allow the design of an 
"Ideal" WFS. 

When detection is performed in the pupil plane, 
the goal of the WFS is to transform a phase aber- 
ration into a light modulation. I now consider two 
symmetric focal plane speckles of amplitude a (rel- 
ative to the central peak amplitude) and phases 4>\ 
and 4>2 (relative to the phase of the central peak). 
Fourier transform of this speckle pair yields pupil 
plane modulation amplitudes of 

M a = 2acos(^±^} (59) 

for amplitude and 

Mp = 2as\n(^±^ (60) 

for phase. As described by equation 3, a pupil 
plane sine-wave phase aberration of amplitude tp 
(in radian) and phase 9 produces two focal plane 
symmetric speckles of amplitude ip/2 and phases 
4>i = tt/2 — 9 and <p 2 = tt/2 + 9. Equations 59 and 
60 confirm that, if the phase and amplitude are left 
unchanged in the focal plane, these 2 speckles cor- 
respond to a pure phase error in the pupil plane 
(M a = 0). At its optimal spatial frequency, the 
CWFS adds it/ 2 to the phase of each speckle, re- 
sulting in M a — 2a and M p = 0. In this particular 
case, the 2 speckles are interfering constructively 
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together in the pupil plane. This level of ampli- 
tude modulation is impossible to reach if the 2 
speckles are optically separated (Pyramid WFSs, 
FPWFS) and explains why only the CWFS can 
be "optimal" {[3 P = 1) for a spatial frequency. Its 
only drawback is that the pupil plane phase offset 
is 7r/2 only at a one value of the angular separa- 
tion. 

In order to build an "optimal" WFS, the focal 
plane offset would need to be 7r/2 at all separa- 
tions. The most practical solution is to change 
the phase of the PSF core by 7r/2 and — 7r/2 al- 
ternatively as shown in figure 9. In closed- loop 
operation in an AO system, the phase offset does 
not need to be achromatic, but should be approx- 
imately ±7r/2 for minimum sensitivity (the phase 
offset in a CWFS is also not achromatic). This 
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Fig. 9. — Schematic representation of the Zernike 
WFS (ZWFS), the WFS with the minimum sen- 
sitivity to photon noise. 

WFS was originally devcloppcd for microscopy 
(Zernike 1934), and is named Zernike Phase Con- 
trast Wavefront Sensor (ZWFS) in this paper. 
More recently, this WFS has been suggested for 
ground-based AO systems (Bloemhof & Wallace 
2003), because it offers a direct phase measure- 
ment (as opposed to wavefront slope for SHWFS). 
The phase mask should be small (size w X/d) if 
the contrast needs to be optimized very close the 
PSF core. In broadband, a speckle might be out- 
side the phase mask in the red end of the band 
but inside it in the blue end. The sensitivity f3 p 
is then more than 1 in an intermediate transition 
region between the inner part of the PSF (/3 is in- 
finite) and the outer part of the PSF (/3 p = 1). 
A Wynne corrector (Wynne 1979; Roddier et al. 



1980), which magnifies the pupil by a factor propo- 
tional to wavelength, may be used to avoid this 
effect. This device was originally developed to in- 
crease the spectral bandwidth of speckle interfer- 
ometry, and has been successfully used on the sky 
(Boccaletti et al. 1998). 

The ZWFS is highly sensitive (/3 P = 1, every- 
where except possibly in the central core of the 
PSF if the mask is large) and quite achromatic, 
but has limited dynamical range: it is ideal when 
used after a low-order first stage AO system. 

Figure 8 shows the PSF contrast component C*2 
obtained with a ZWFS. 

4.7. Discussion 

Table 3 summarizes the results obtained previ- 
ously and lists (3 P and (3 a for the 7 WFSs compared 
in this study. 

4-7.1. Sensitivity for OPD measurement 
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Fig. 10. — Value of /3 P as a function of angular 
separation for the WFSs compared in this study. 
The WFSs were optimized for a separation of 0'.'5. 
For the SHWFS, r = 0.2m and A = 0.5^m. 

For some WFSs (SHWFS, CWFS and MPYR- 
WFS) (i p reaches its minimum at a given distance 
from the optical axis, and increases closer to the 
PSF core: these WFSs suffer from the noise prop- 
agation effect (low sensitivity to low-order modes 
due to photon noise). The other WFSs (FPYR- 
WFS, PPMZWFS and FPWFS) maintain a con- 
stant value of (3 P at all separations: noise propaga- 
tion is low, and low-order terms can be corrected 
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Table 3 
Comparative table of WFSs 



SHWFS CWFS FPYRWFS MPYRWFS PPMZWFS FPWFS ZWFS 



0v 


> 2 


> 1 




> 2 


2 


v5 


1 


|9o 


1 


/ 1 
V l-l//3p a 


V2 


> 1 


1 


V2 




Optimal scp. set by # lcnslcts defocus - modul. rad. 


Noiso propagation 


high 


high 


low 


high 


low 


low 


low 


Achromaticity 


good 


good 


good 




good^ 


good- 1 


good 


Aliasing 


high 


high 








none 




Solutions tu aliasing 13 


SF 


DS,SF,OF 


DS,SF,OF 


DS,SF,OF 


DS,SF,OF 




DS,SF,OF 


Dynamical range 


high 


high 


low 


high 


low 




low 


Elements 


lenslcts 


pixels 


pixels 


pixels 


pixels 


pixels 


pixels 


Detectors per element 


> 4 


1 


4 


4 


2 


> l d 


1 



a With the use of a Wynne corrector. 

^Aliasing can be reduced by higher detector sampling (DS), spatial filtering in the focal plane (SF). or an optical anti-aliasing 
filter (OF). 

c R.equircs achromatic phase shifts. 
^Set by focal plane pixel scale. 
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Fig. 11. — Value of th, the optimal sampling, as a 
function of angular separation for the WFSs com- 
pared in this study (m v — 5 source). 



efficiently at the same time as high-order terms. 
For most WFSs, correcting for amplitude+phasc 
instead of only phase increases (3 P , with the ex- 
ception of the SHWFS, which measures amplitude 
"for free" (photometry within each subaperture). 
As will be shown in §2, in the photon-noise limited 
regime, the PSF contrast achieveable by an AO 
system varies as (3 p 4 ^ 3 (equation 20): the differ- 
ences shown in figure 10 are therefore important. 
For example, the contrast can be 1.6 times better 
in a CWFS-base or ZWFS-based AO system than 
in a FPWFS-bascd or FPYRWFS-based system. 
The "ideal" ZWFS outperforms the PPMZWFS 
by one magnitude in contrast. In the example 



shown in figure 10, the SHWFS, even perfectly 
tuned for optimal PSF contrast at 0'.'5, produces 
a PSF contrast 4.3 worse than a ZWFS. As shown 
in figure 11, the sampling frequency required to 
reach optimal performance on a bright source is es- 
pecially high at low spatial frequency for the most 
efficient WFSs. 

Non-common path errors can limit the achiev- 
able contrast, and are almost unavoidable in pupil- 
plane WFSs (all WFSs except for FPWFS). Very 
accurate calibration is then required, and can be 
obtained by focal plane phase diversity. The FP- 
WFS is immune to this effect if the wavefront sens- 
ing and scientific focal planes are shared, which 
is likely to be the case for visible coronagraphic 
imaging of extrasolar planets from space (TPF 
mission). However, on ground-based AO sys- 
tems, wavefront sensing in the visible and scien- 
tific imaging in the near-IR is often prefered for 
scientific and technological (detectors) reasons. 

All WFSs studied in this paper have good 
achromaticity and can be used in broadband light. 
The SHWFS, CWFS and FPYRWFS are natu- 
rally achromatic, while other WFSs require ei- 
ther achromatic phase shifters (PPMZWFS and 
ZWFS) or a Wynne corrector (FPWFS). 

4-7.2. Sensitivity for scintillation measurement 

The steps to compute (3 a are not detailed in 
this work, but comparison with the computation 
of j3p reveals that (3 a = 1 if all the light is used to 
image the pupil. From this result, (3 a can be easily 
estimated for all WFSs considered in this study. 
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In this work, I choose to adopt (3 a = 1 for all 
WFSs in subsequent numerical simulations. While 
this is exact for the SHWFS, which does not opti- 
cally modify the light intensity in the pupil plane, 
this is not true for most other WFSs. For exam- 
ple, in the CWFS, l//3 a 2 + l//3 p 2 = 1 (equations 
35, 59 and 60): at the optimal angular separation 
(defined by [3 p = 1) the CWFS is insensitive to 
scintillation {fi a = oo). If [3 a is high and C3 » C2, 
then a fraction of the total flux (or, equivalently, 
time) needs to be allocated to scintillation sensing, 
which is performed most efficiently by imaging of 
the pupil. For example, in the CWFS, a fraction 
of the time is spent at dz = (no defocus in the 
focal plane). This sharing of the photons increases 
C2 and decreases C3 until C2 + C3 is minimal. 

However, as shown in §5, C\ < C within the 
central arcsecond: OPD aberrations are stronger 
than scintillation at low spatial frequencies. Since 
both terms arc moving in front of the telescope 
with the same speed v, the post-correction scin- 
tillation residual C3 can be made comparable to 
post-correction OPD residual C2 by allocating a 
small fraction of the incoming photons to scintil- 
lation measurement. The PSF contrasts obtained 
with the approximation (3 a = 1 are therefore only 
slightly optimistic within the central arcsecond: 
(3 P sets the value of C2 + C3, not fi a . 

Beyond a = 1", however, Co ~ Ci, and if 
a WFS is characterized by fi a — 00, (3 P = 1, 
half of the photons should be allocated to pure 
scintillation measurement. This would result in 
Pa = Pap = V2, which would produce contrasts 
C2 and C3 equal to 2 2 / 3 s=s 1.6 the values obtained 
with the optimistic approximation (5 a = 1. The 
maximum error made by the (5 a = 1 approxima- 
tion is therefore a factor 1.6 on C2 and C3, and 
can only occur at large angular separation (a >1") 
with the CWFS and the ZWFS. 

5. Contrast performance 

5.1. Parameters adopted for numerical 
simulations 

Table 4 lists the default parameters adopted in 
this work for numerical simulations. The atmo- 
spheric parameters correspond to conditions fre- 
quently encountered atop Mauna Kea, Hawaii. 
The weigths and altitudes of the turbulence lay- 
ers are derived from 4 nights of MASS and Sci- 



dar measurement atop Mauna Kea (Tokovinin et 
al. 2005). The photometric zero point of the 
WFS (corresponding to an equivalent bandpass of 
O.l^m) is representative of existing WFSs. 

Through the paper, some of these parameters 
are modified to evaluate the contrast performance 
of a system which departs from this default con- 
figuration: wavefront sensing and imaging wave- 
length in §5.3 and atmospheric parameters in §5.4. 
The contrast performance can also easily be de- 
rived for telescopes larger than 8m: since the con- 
trast limits Co to Cq are all proportionnal to l/D 2 , 
the overall contrast for all WFSs is proportionnal 
to l/D 2 . 

5.2. Relative contribution of contrast lim- 
its components in conventional AO. 
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Fig. 12. — Contrast limits imposed by the uncor- 
rected atmospheric turbulence (CO and Cl), cor- 
rected atmospheric turbulence (C2 and C3), chro- 
matic effects (C4, C5, and C6) for a 8m telescope 
and a m v = 5 source. See text for details. 

Figure 12 shows the relative contributions of 
Co, Ci, C2, C3, C4, C5 and Cq when an ideal 
WFS (ZWFS) is used on a bright (m v = 5) star. 
According to the results obtained in §4, this wave- 
front sensing scheme is the most sensitive, and 
other WFSs will show higher values of C2. The 
main parameters of the simulation are listed in ta- 
ble 4, and are used through this work unless other- 
wise specified. Chromatic effects introduced by ei- 
ther the refraction index of air (component C§) or 
Fresnel propagation through the atmosphere (C4 
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Table 4 

Default simulation parameters 



Telescope diameter 


D 


8 m 


Detection wavelength 


Ai 


1.6 /Km 


Source brightness 


m v 


5 


WFS wavelength 


A 


0.55 /Km 


WFS bandpass 




0.1 /itm 


WFS quantum efficiency 




1.0 


Wl S zero point (counts tor m v = source) 




9.74 10 cnt/s/m 


WFS sensitivity to OPD 


P P 


1.0 


WFS sensitivity to scintillation 


Pa 


1.0 


Fried parameter 


ro at 0.5 fj,m 


0.2 m (0.5" seeing) 


Wind speed 


V 


10 m/s 


Number of layers 




6 


Layer 1 altitude / C 2 fraction 




0.5km / 0.2283 


Layer 2 altitude / C 2 fraction 




1.0km / 0.0883 


Layer 3 altitude / C 2 fraction 




2.0km / 0.0666 


Layer 4 altitude / C 2 fraction 




4.0km / 0.1458 


Layer 5 altitude / C 2 fraction 




8.0km / 0.3350 


Layer 6 altitude / C n fraction 




16.0km / 0.1350 


effective altitude (^/no) 1 ^ 2 


22 


7.7km 


effective altitude (/X5/3/ /io) 3 ^ 5 


h 


7.1km 



and C5) can have a strong impact on the PSF con- 
trast. One very important result from this study 
is that Co, Ci, C 4 and C5 are all comparable be- 
yond 2". Similarly, C2 and C3 are comparable be- 
yond about 2". The implications of this result are 
now discussed separately for AO systems correct- 
ing only OPD and AO systems correcting OPD 
and scintillation. For now, I choose to limit this 
discussion to AO systems performing wavefront 
sensing in the visible and imaging in the near-IR, 
as choices of wavelengths will be discussed in §5.3. 

• OPD correction with AO : In an OPD- 

only AO correction (C = C x +C 2 +C 4 + C 6 ), 
the uncorrected scintillation C\ dominates 
by far the achievable PSF contrast within 
the central 2", and limits it to approximately 
10 -4 to 2 10~ 4 within the central arcsec- 
ond. The term due to photon noise and 
time lag, C2 is much lower, at about 1CP 7 
for this bright (m v — 5) source. The ef- 
fect of photon noise would become dominant 
only for my > 13 with a high-sensitivity 
WFS {(3 P = 1). The OPD chromaticity term 
C4 due to Fresnel propagation is small close 
to the PSF center, but is rapidly increasing 
with angular separation, and is comparable 



to scintillation C\ at 2"and beyond. 

• OPD+scintillation correction with AO 

: With an AO system correcting both OPD 
and scintillation (C = Ci + C3 + C4 + C5 + 
Cg), the term C5 due to the chromaticity 
of scintillation limits the PSF contrast to 
slightly better than 1CP 4 in the central 2". 
Scintillation, if measured at A ss 0.5/zm, can- 
not be well corrected for at Ai w 1.6/Ltm. The 
improvement to the PSF contrast brought by 
correction of scintillation with the AO sys- 
tem is quite modest (about a factor of 2). 

Beyond about 2", C w C , and "classical" AO 
(wavefront sensing in visible, imaging in near-IR), 
even with scintillation correction, cannot improve 
the PSF contrast: there is no use to increase the 
number of elements beyond this limit. 

5.3. Choice of wavefront sensing and imag- 
ing wavelengths 

The number of photons available for wavefront 
sensing is a function of spectral type, and should 
be maximized to reduce the PSF contrast Ci. In 
this section, I consider the number of photons 
available for wavefront sensing to be independant 
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of A, which is a good approximation for a spec- 
tral type G2 and a fixed spectral bandwidth (5X/X 
constant). 

Figure 13 illustrates how A and A^ affect PSF 
contrast components Co to C%. When A = Aj, 
chromatic terms (C4, C5 and C 6 ) disappear, and 
the PSF contrast is driven by WFS photon noise 
through C 2 and C 3 (for an AO system correcting 
OPD and scintillation) or C\ (for an AO system 
correcting only OPD). In all configurations, Cq has 
a negligible impact on PSF contrast beyond 0'.'3. 




angular separation (") 



£ 1e-05 



WFS operating at 1 .60 urn 

1.25 um 




1.65 um 2.2 urn 



1.6 |lm 



angular separation (") 

Fig. 14. — PSF contrast for a WFS operating at 
0.55fim (top) and 1.6/zto (bottom) after AO cor- 
rection of OPD and scintillation. The PSF con- 
trast is shown at a function of angular separation 
at imaging wavelengths ranging from 0.55^to to 
2.2fim in each case. 

These results are combined in figure 14, which 
shows the achievable PSF contrast as a func- 



tion of imaging wavelength when the WFS wave- 
length is fixed. An AO system correcting both 
OPD and scintillation is considered in this fig- 
ure, with an "ideal" WFS operating in the vis- 
ible (top) or in the ncar-IR (bottom). When 
A = Xi, the photon-noise driven PSF contrast is 
between 10 -6 and 1CP 7 in the central arcsecond 
in both cases. The PSF contrast degrades very 
rapidly as Xi becomes different from A. In con- 
ventional AO, where A ~ 0.55/im and imaging is 
performed in the near-IR, the PSF contrast, dom- 
inated by chromatic effects, is limited to a few 
times 10~ 5 in the central arcsecond. In order to 
approach the photon-noise limit (C 2 + C3), imag- 
ing needs to be performed within about 0.05yum 
of the wavefront sensing wavelength. When ob- 
serving a bright source, figure 14 illustrates that 
the wavefront sensing wavlength should be chosen 
equal to the imaging wavelength. 

This statement is at variance with the com- 
mon practice of combining visible wavefront sens- 
ing and near-IR imaging to yield the best possible 
contrast. Figure 13 does indeed show that this is 
the optimal choice if chromatic effects are ignored. 
In current AO systems, the WFS detector and sen- 
sitivity (3 P are sub-optimal (the perfect example is 
a SHWFS with finite readout noise CCD), and er- 
rors are dominated by photon noise except for the 
brightest sources: in this regime, figure 13 shows 
that visible A and near-IR Xi is optimal. More- 
over, finite number of actuators, lack of aliasing 
mitigation scheme and non-common path errors 
set a limit to the PSF contrast even for bright 
sources: the effect of these OPD errors on the 
PSF contrast is mitigated by increasing Xi. For 
these reasons, most current AO systems do not 
achieve a 10~ 4 PSF contrast in the central arc- 
second, and are therefore not dominated by chro- 
matic effects. However, an AO system designed 
to maximize PSF contrast (using an efficient WFS 
and an aliasing mitigation scheme) would be dom- 
inated by chromatic effects if A ^ Xi . 

If a visible WFS is used for imaging in the near- 
IR, the chromatic components C4 and C5 are fixed 
phase and amplitude screens moving in front of the 
telescope's pupil with speed v. The coherence time 
of the corresponding speckles is therefore long, un- 
like the fast residual speckles due to time lag and 
photon noise (contributions Co and C\). These 
slow speckles are very detrimental to the final de- 
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Fig. 13. — PSF contrast components Co to C% as a function of angular separation for different combinations 
of WFS wavelength A (shown at the top, in micron) and imaging wavelength Aj (shown on the left, in 
micron) . 



tection limit, as they require long exposure times 
to average into a smooth continuous background. 

5.4. Observing site 

If A 7^ Xi (contrast dominated by chromaticity 
effect), the PSF contrast C is dominated by C5 
as illustrated in figures 13 and 14. Equation 26 
for small angular separations (within the central 
arcsecond) leads to: 



C oc z 2 2 r 



-5/3 



where 



2 V fC*{z)dz ■ 



(61) 



(62) 



The contrast is then independant of wind speed v. 
In this regime, multiplying z 2 by 0.6 (40% lower 



altitude turbulence) is equivalent to improving the 
seeing by a factor 2. The high importance of tur- 
bulence heigth on PSF contrast is illustrated in 
figure 15, where the turbulence profile given in ta- 
ble 4 has been scaled in altitude to modify zi- All 
other parameters of for this simulation are given 
by table 4. 

If A w A, (chromaticity effects are small), the 
PSF contrast is dominated by C2, as shown in fig- 
ure 13. At small angular separations, 



C: 



ro -5/9 w 2/3_ 



(63) 



The contrast is then independant of the atmo- 
spheric turbulence profile. 

Figure 16 shows the PSF contrast for the ob- 
serving conditions listed in table 5 for A = Aj = 
0.85 [im, and for A = 0.55 fim, A, = 1.6/xm. Pa- 
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Table 5 

Observing sites characteristics 



Site 


Seeing 


ro 


TO 


V 


#0 


Z2 




(") 


(m) 


(ms) 


(m/s) 


(") 


(km) 


This work 


0.5 


0.2 


6.3 


10 


1.5 


7.7 


A 


0.58 


0.17 


2.7 


19.8 


1.9 


6.3 


B 


0.35 


0.29 


5.4 


16.9 


3.8 


5.4 


C 


0.27 


0.37 


7.9 


14.7 


5.7 


4.6 


D 


0.10 


1.0 


20 


15.7 


8.5 


8.3 


E 


0.74 


0.14 


3.3 


13.3 


2.6 


3.8 


F 


0.50 


0.2 


6.9 


9.1 


3.6 


3.9 



0.001 



1e-04 



Oh 1e-06 



1e-07 
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Fig. 15. — Effect of the effective turbulence alti- 
tude zi on the PSF contrast. Parameters other 
than Z2 for this simulations are given in table 4. 

ramcters not listed in table 5, such as telescope 
diameter, were taken from table 4. Atmospheric 
conditions A and B in table 5 were derived from 
measurements atop Mauna Kea (Subaru Telescope 
2004; Racine & Ellerbroek 1995), where A is rep- 
resentative of median conditions, while B corre- 
sponds to excellent conditions (roughly 10th per- 
centile). Models C and D corresponds to re- 
spectively median and 10th percentile conditions 
measured by Lawrence et al. (2004) at Dome-C, 
Antartica. Since these measurements were ac- 
quired over a short timescale (22 nights within a 
50 days period), and were insensitive to low alti- 
tude (less than 30m) turbulence, they might not 
accurately represent the site's atmospheric con- 
ditions. Models E and F are representative of 
median and 10th percentile conditions on Cerro 



Parannal (ESO 2004). 

The atmospheric turbulence outer scale Lq, 
which was not included in the model used in this 
study, is usually larger than 10m, and has there- 
fore no direct effect on the PSF contrast beyond 
0'.'05 in the ncar-IR. 

6. The case for focal plane wavefront sens- 
ing 

The results obtained in §5 suggest that wave- 
front sensing and imaging should be performed at 
the same wavelength. The PSF contrast is then 
driven by the sensitivity of the WFS, and can 
reach up to 10~ 7 in visible/ncar-IR on a 8m tele- 
scope in the central arcsecond for a bright source. 
To achieve this level of contrast, a high sensitiv- 
ity WFS should be chosen, and non-common path 
errors need to be carefully calibrated. 

The FPWFS offers high sensitivity (only sur- 
passed by the ZWFS) and does not suffer from 
aliasing or non common path errors (if the same 
focal plane is used for scientific imaging and wave- 
front sensing). If amplitude and OPD errors 
are comparable and both need to be measured, 
FPWFS is in fact as sensitive as the "ideal" 
ZWFS. Focal plane wavefront sensing is therefore 
an extremely attractive solution for high contrast 
ground-based AO, and I show in this section how 
it could be implemented. 

In order to estimate the complex amplitude in 
the focal plane, a set of N reference waves are com- 
bined in the focal plane with the speckles. Fol- 
lowing the notation adopted in §4.5, the complex 
amplitude of the reference wave k is noted {xk,yk) 
(real and imaginary parts) and is a function of the 
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Fig. 16. — PSF contrast for different observing 
sites. On the top panel, A = Aj = 0.85/um. On 
the bottom panel, A = 0.55 fim and Aj = 1.6/um. 
Parameters other than r , v and Z2 for this simu- 
lations are given in table 4. 

position in the focal plane. As pointed out in §4.5, 
the amplitudes of the reference waves are not im- 
portant, provided that they are larger than the 
amplitude of the speckles that need to be sensed, 
and a minimum of 2 waves arc needed to reach op- 
timal performance. If only 2 waves arc used, their 
phases should be offset by ir/2 ideally As more 
reference waves are used, the number of nearly- 
optimal solutions increases and more flexibility in 
the choice of the reference waves is available. 

The reference waves can be produced by phase- 
shifting of light extracted from the central part of 
the PSF in the focal plane (Angel 2003; Codona & 
Angel 2004). The interference between the speckle 



cloud and the reference waves is obtained through 
a Mach-Zender type interferometer with a beam 
splitter. This approach allows to create the opti- 
mal reference wave for focal plane wavefront sens- 
ing but requires additionnal optics. 

A slightly less optimal, but optically simpler 
solution is to use the DM to produce the required 
reference waves. Each reference wave is created 
by a command sent on the DM: for example, mov- 
ing a single actuator produces a reference wave 
of quasi-constant amplitude within the control re- 
gion, and of phase given by the position of the 
actuator within the pupil. Provided that the influ- 
ence functions of the DM and the behaviour of the 
coronagraph are known, the complex amplitude of 
the reference wave in the focal plane can be com- 
puted to a good accuracy. In a fast closed-loop 
system, the accuracy with which these reference 
waves are known does not need to be very high 
(about 10% accuracy is sufficient) to be photon- 
noise limited. 

Although the full optimization of the set of ref- 
erence waves is beyond the scope of this work, I 
now show that reference waves well suited to focal 
plane wavefront sensing can be obtained with the 
DM : 

• Extent of reference wave in the focal 
plane. The size of a reference wave in the 
focal plane is set by the size of an actuator of 
the DM. It is therefore possible to produce 
reference waves with relatively high ampli- 
tude across all of the diffraction control do- 
main (DCD, the region in the focal plane for 
which the DM sampling is sufficient to sup- 
press diffracted light). 

• Ability to set the amplitude of the ref- 
erence wave. Small amplitude waves can 
easily be created by moving a single actua- 
tor. Increasing the displacement of this ac- 
tuator will increase the reference wave am- 
plitude up to l/N act of the peak amplitude of 
the PSF for an unapodized pupil, where N act 
is the total number of actuators in the DM. 
For a PSF contrast better than 10 -7 and a 
DM with more than 10 7 actuators, actua- 
tors need to be moved in groups to produce 
reference waves of sufficient amplitudes. 

• Ability to produce an achromatic 
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phase in the reference wave. A small 
positive displacement of the center actua- 
tor of the DM produces a reference wave 
of achromatic phase 7r/2 in the focal plane 
(— 7r/2 for a negative displacement). The 
amplitude of this reference wave is however 
chromatic, but this effect will not seriously 
affect the system performance in a closed- 
loop system, as it is equivalent to multiply- 
ing the signal (speckle intensity modulation) 
by a wavelength-dependent gain of constant 
sign. The phase of reference waves produced 
by moving actuators other that the central 
actuator are also achromatic, provided that 
a Wynne corrector is used. 

Actuators near the edge of the pupil should prefer- 
ably not be used to produce the focal plane ref- 
erence waves, as the resulting phase would vary 
rapidly across the focal plane image. 

7. Coronagraphy 

The contrast limits derived in §4 and §5 as- 
sumed that the only sources of scattered light are 
the wavefront errors (phase and amplitude) in the 
corrected beam. These results only apply to op- 
tical systems in which the static diffraction (Airy 
pattern on a circular aperture) is below the con- 
trast levels derived in this work. In this section, 
I discuss the validity of this approximation, and 
briefly summarize the options available (coronag- 
raphy) to insure that the limits derived in §4 and 
§5 can be reached. 

7.1. Need for suppression of Airy pattern 

Figure 17 shows that with an efficient WFS, 
diffraction associated with the Airy pattern is 
much stronger than light diffracted by residual 
wavefront errors at small angular separations. At 
some distance from the optical axis, both contri- 
butions are equal. Beyond this separation, coro- 
nagraphy is not required to reach the contrast 
level achievable by the AO system. As can be 
seen in figure 17, this critical angular separa- 
tion decreases as the telescope diameter increases. 
For reasonnable size telescopes (100m diameter or 
less), this critical separation is larger than 2", and 
suppression of the Airy pattern is therefore re- 
quired. While light diffracted by residual wave- 
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Fig. 17. — PSF contrast with an optimal WFS 
(/3 = 1) and Airy pattern diffraction for 8m, 15m, 
30m, 60m and 120m diameter telescopes. The 
black circles mark the points where the theoret- 
ical PSF contrast achievable with a perfect coron- 
agraph equals the Airy pattern diffraction contrast 
level. 

front errors is time-variable, and might not aver- 
age nicely with time, the Airy diffraction pattern 
on the other hand is very stable and can be cali- 
brated accurately. One might therefore wonder if 
relatively high levels of static diffraction features 
in the PSF are really detrimental to high con- 
trast imaging. Static diffraction actually amplifies 
the time- variable speckles, an effect referred to as 
"speckle pinning" (Blocmhof et al. 2001; Bloemhof 
2003; Aime & Soummcr 2004). This is due to the 
fact that the complex amplitudes of static and dy- 
namic speckles add in the focal plane, and the light 
intensity I measured is: 

1= {A s + A d f = I s + 2^IJ~ d + I d (64) 

where I s and I d are the static and dynamic focal 
plane intensities, and the phase term between the 
2 contribution has been ommited for simplicity. 
Assuming that I s is well known and can be per- 
fectly subtracted, if I s > I d (Airy patter is bighter 
than C), the "speckle noise" becomes dominated 
by 2\[Tj~ d and is therefore 2^/I s /I d stronger than 
it would be if I s = 0. In the example considered in 
figure 17, at 0'.'5 separation, adding a coronagraph 
reduces the "speckle noise" by factors 52, 26 and 
19 on 8m, 30m and 60m telescopes respectively. 
With focal plane wavefront sensing, the static 
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diffraction of the telescope pupil can be treated 
just as atmospheric speckles, and can therefore be 
perfectly cancelled in half of the focal plane with 
a single DM. The DM phase is then used to can- 
cel the static diffraction (Airy rings for example) 
created by the pupil intensity distribution, remov- 
ing the need for a coronagraph, or at least lower- 
ing the requirement on its intrinsic performance. 
Since the DM phase is chromatic, but the pupil 
intensity is not, the resulting dark region of the 
PSF is chromatic (Codona & Angel 2004) and does 
not allow the use of a wide spectral band. Yang 
& Kostinski (2004) however found that solutions 
with very low chromaticity exist with a square 
telescope pupil, and similar solutions might exist 
with circular telescope pupils. If successfull, this 
technique could be used instead of a coronagraph 
in narrow band imaging, and would be optically 
very simple, provided that the DM can produce 
the required phase functions: the solutions found 
by Yang and Kostinski require a large phase slope 
at the edge of the pupil. 

7.2. Coronagraph/ WFS combination 

At large angular separation the coronagraph 
does not need to attenuate the Airy pattern by 
a large factor (at most a factor 100 attenuation is 
required at 075 according to figure 17), and many 
suitable coronagraphic options are therefore avail- 
able. 

When observation at small angular separation 
(a few X/d) is required, the choice of the corona- 
graph is more critical. Coronagraphs with small 
IWA exist, but are often prone to: 

• (1) Sensitivity to tip-tilt and low order 
modes: This effect is problematic on coron- 
agraphs for which small (less than X/d) tip- 
tilt errors can scatter light at large outside 
the IWA (Roddier 1997; Rouan et al. 2000; 
Baudoz et al. 2000). The coronagraphic 
leaks are especially large if they are com- 
bined with a WFS having poor sensitivity 
to low-order modes, such as a CWFS or 
a high-order SHWFS. A FPWFS is prefer- 
able if the contrast at very small IWA needs 
to be optimized. Coronagraphs relying on 
pupil apodization manage to keep this effect 
small while offering small IWA (Kasdin et 
al. 2003; Guyon et al. 2005). 



• (2) Chromaticity: Coronagraphs with small 
focal plane masks are sensitive to the 
wavelength-dependent PSF scale. Several 
designs have been proposed to mitigate 
(Soummer et al. 2003a; Rouan et al. 2000) 
or solve (Baudoz et al. 2000) this problem. 

• (3) Reduced throughput due to pupil and/or 
focal plane masks (Kasdin et al. 2003; Kuch- 
ncr & Spergel 2003; Soummer et al. 2003), or 
splitting of light (Baudoz et al. 2000): Loss- 
less apodization can be used to avoid this 
problem (Guyon et al. 2005). 

• (4) Lower image quality: broader PSF due 
to apodization (Kasdin et al. 2003) or double 
images (Baudoz et al. 2000). 

These effects can be especially problematic if the 
WFS is placed after the coronagraph, as would 
likely be the case in a FPWFS-based system: 
chromaticity, reduced throughput and lower im- 
age quality would then effectively reduce the WFS 
signal-to-noise ratio and compromise the achiev- 
able contrast ratio. 

8. Conclusion 

A thourough comparison of the fundamental 
contrast limits of AO has shown that visible wave- 
front sensing doesn't allow high accuracy correc- 
tion of near-IR wavefront aberrations: chromatic 
effects then limit the PSF contrast to 10 4 to 10 5 
within the central arcsecond. Wavefront sensing 
should therefore be performed at the same wave- 
length as imaging to reach the contrast limit im- 
posed by photon noise (about 10~ 6 to 10~ 7 in the 
central arcsecond). 

An AO system optimized for high contrast, 
with wavefront sensing and imaging at the same 
wavelength, can still greatly benefit from a first 
stage AO correction with a shorter wavelength 
(visible) WFS: 

• If residual aberrations are small, a FPWFS 
can be used efficiently. This WFS offers 
unique advantages: no non-common path er- 
rors, no aliasing, high sensitivity. It there- 
fore appears to be the ideal solution for high 
contrast AO. 

• If wavefront correction were perfect at the 
shorter wavelength, the chromatic residu- 
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als that the second WFS needs to measure 
would be small and relatively slow (same 
speed as the uncorrected turbulence). The 
PSF contrast achievable in this case could be 
better than the limits derived in this work, 
as more photons are used. 

This solution is especially attractive since low- 
noise fast visible detectors exist, while near-IR de- 
tectors currently offer lower performance. Theo- 
retically, combining a fast high sensitivity visible 
WFS (preferably a ZWFS) with a slower near-IR 
WFS (preferably a FPWFS) is not as advanta- 
geous for red sources as it is for bluer (spectral 
type G or bluer) sources. 

The PSF contrast estimates derived in this pa- 
per represent a limit which is hard to reach, as 
many optimistic hypothesis have been made: ob- 
servation at zenith, perfect telescope and detec- 
tors, perfect DM, high system throughput, favor- 
able atmospheric conditions, perfect coronagraph, 
bright m v = 5 source, no time delay for AO con- 
trol. On a 8m telescope, PSF contrast up to 1CT 6 
may be reached in the central arcsecond if devi- 
ation from these optimistic assumptions is mini- 
mal. Even on a 100m telescope, the corresponding 
contrast (about 10~ 8 ) is two orders of magnitude 
short of what is required to detect an Earth-size 
planet orbitting a solar-type star. Direct imag- 
ing of extrasolar planets is therefore bound to rely 
heavily on efficient calibration of the speckle noise 
(through differential imaging techniques for exam- 
pie). 

The author is thankful to the referee, Rene 
Racine, for his detailed and thorough look at this 
work, which led to many useful suggestions. 
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A. Computing the sensitivity of a WFS to photon noise for the measurement of a pure sine 
wave phase error 

In this appendix, I define (3 p , the quantitative measure of sensitivity to photon noise of a WFS, and show 
how it can be derived. The steps described here were used for each of the WFSs considered in this study to 
compute (3 P for a range of spatial frequencies. 

The wavefront error given in equation 2 can be represented in a 2D plane coordinate system by 

xo = -7— cos(#) (Al) 
A 

y = ^sm(9) (A2) 

where xo and yo are both in radian. 

The WFS produces a set of N values Iq, h 1 ...In-i (usually light intensities, but can also be centroid 
positions for a SHWFS) which are used to compute x and y, the measured values of xq and yo- Because of 
photon noise, x ^ xq and y ^ y . After correction by the DM, the residual sine wave phase aberration at 
the spatial frequency considered has an amplitude A res (in radian): 

A res = ^/(x-x ) 2 + (y-y a ) 2 . (A3) 

In order to compute X, the residual phase error in the pupil plane in radians rms, the probability distri- 
bution P(x,y) of x and y can be used: 



S = J [ P(x, y)((x - x )2 + ( y - y )*)dxdy. (A4) 

P(x, y) can be written as 

N-l 

P(x,y) = [] Pk(x,y), (A5) 

fe=0 

where P}-{x,y) is the probability distribution of x and y given by the measurement Ik- 

I denote 07 fc 2 the variance on the measurement of for which a gaussian probability law is assumed. I 
also assume that Ik is a linear function of x and y around (xo,yo) (this is true in closed- loop AO systems, 
as Xo,yo, x and y are small) : 

I k (x,y) = I k (xo,yo) + ^X + ^Y (A6) 

where the derivatives are computed in (x ,yo), X = x — x and Y = y — y - 

In the (x,y) plane, a measurement Ik gives a probability Pk only along one axis (the normalization 
coefficients of probability distributions are omitted in this section): 

P k {x,y)=e^- {KX - >r Y ? (AT) 



2a fc 2 



where k x 2 + k v 2 = 1, 



and 



kx_ _ dl k f djk 
k y dx \dy 



(A8) 



<"■ = <"* \ hrr + hrr ■ ( A9 ) 
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The vector (k x ,k y ) gives the direction along which Ik is "sensitive". For example, if dlk/dx = d Ik/dy, 
(k x ,ky) = (1/V2, 1/V2), and P(x, y) is only a function of X + Y. In this example, the probability is constant 
along a line x — est — y, as the 2 partial derivatives of Ik cancel when moving along this line. 
From equation A5, 

' X 2 XY 



where 



P(x, y) = exp - 



1 
1 

"3 



«2 



N k 2 



fe=0 
N 



2^ 



k x ky 



N 



k 2 

k 2 °* 



a 3 



(A10) 

(All) 
(A12) 
(A13) 



Y = 

y-y 
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* y' 














1 4 












. x = 








a x x-x 








- X 




X 








Fig. 18. — Graphical representation of the domain P(x,y) < exp(— 1) from equation A14. The domain is 
circular for p — 0, and elongated for p 7^ 0. In this example, a x > <r y and p > 0. Ai and A2 are the maximal 
and minimal radii of the ellipse. 



Equation A10 is a 2D normal law : 

P(x, y) = exp 

where 



2(1 -P 2 ) 



P = 



X 2 2pXY Y 2 



<j x <j. 



2a 2 



(A14) 



(A15) 
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W^W) (A16) 



W^7) (A17) 

A graphical representation of P(x,y) is shown in figure 18. In the coordinate system (x',y') which axis 
are aligned with the long and short axis of the ellipse P(x,y) = exp(-l), 

P(^,')=ex P (-i^ + |4)) (A18) 

where 



2 + CTy 2 ~ ~ Oy 2 ) 2 + 4 p 2 (J x 2 Oy 2 

A 2 = (AzU) 

From equations A4.A18, A19 and A20, 

s = yV + A 2 2 ( A21 ) 

S = y/er x 2 +<T v 2 (A22) 

S can now be used as a quantitative measure of the ability of a WFS to sense a sine wave phase aberration 
in the pupil plane. For most WFSs, S is a function of 9, in which case I consider T, max , the maximum value 
of S over all values of 9. Since the measurement errors are produced by photon noise, 

^ - ^ (A23) 



fNph 



where N p h is the total number of photons available for wavefront sensing, and (3 is a function of the WFS. 
The parameter /3 P represents the sensitivity of the WFS to photon noise for the spatial frequency considered. 



B. Algebraic representation of WFSs 

In this appendix, I show that any wavefront sensor can be represented as a unitary matrix U and a 
stochastic matrix S c . 

I denote W{u) the complex amplitude of the incoming wavefront at the position u in the pupil. 

W(u) = A(u) x e l *W (Bl) 

where A(u) is the amplitude and 4>{u) the phase of the wavefront. In all WFSs, the incoming wavefront is 
estimated from measurements of the intensities (square of the modulus of the complex amplitude) obtained 
by mutual interferences of parts of the incomping wavefronts. One such intensity, Ik can be written as 

r 2 

h = \B k \ 2 = | J fk(u)W(u)du\ (B2) 

3> 
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where fk (u) is a complex function of u and 



Vk,\f k (u)\ 2 < 1 



(B3) 



The number of such measurements is to. Another constraint is that the incoming light is shared between 
the different outputs : 



re3>^£|/ fe (5)| 2 = l. 

k=l 

Finally, the light intensity is conserved between the input (wavefront) and the outputs : 

m 

\/A,J2h= A(u) 2 dr 



(B4) 



(B5) 



fe=i 



The continuous incoming wavefront can be approximated by its values on the points of a fine 2D grid 
: this representation is accurate up to the spatial frequency defined by the spacing of the grid elements. 
In this representation, the incoming wavefront is a vector A (representing W in equation Bl) which has as 
many elements A k as there are evaluation points on the pupil. I denote n the number of such elements. 
Each function fk(u) is represented by a vector Uk of n elements U k , I = 1 • • • n. The measured intensities are 
a vector J of to elements which is the square of the amplitude of the vector B (elements B k ) : 



h 

h 



B l 

B 2 



Br, 



\B\ 



(B6) 



with 



B = UA = 




ul ■ 
ul ■ 


■ t/r 

• u? 




. ul 


ul ■ 


■ u™ 



A! 

A 2 



A n 



(B7) 



The conservation of total light intensity requires 

VA,\\UA\\ 



(B8) 



Setting to = n, U is therefore a unitary matrix. In all current WFSs, to is in fact infinite and the measured 
quantity is not directly /, but a set of intensities obtained by redistributing the values of I among a smaller 
number of variables, which can be represented by a stochastic matrix S c preserving the total flux. For 
example, in a SHWFS, the intensity measured by a pixel is in fact the integral of the intensity accross the 
pixel. 
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